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SOME NEW METHODS MATRIX 


Columbia University 
INTRODUCTION 


The increased practical importance matrix calculation. This paper will 
concerned chiefly with methods finding the inverse matrix, and 
finding the latent roots and latent vectors, which are also known variety 
other names associated with particular applications, such principal axes 
geometry and mechanics, and principal components psychology. These two 
computational problems are extremely wide application. The first closely 
related the solution systems linear equations, which shall also con- 
sider. the method least squares the solution the normal equations 
best carried out with the help the inverse the matrix the coefficients, 
since least some the elements this inverse matrix are needed evaluating 
the results terms probability, vitally necessary step, and since the inverse 
matrix useful also various other ways, such altering the set predictors 
used regression equation. Modern statistics also utilizes quadratic and 
bilinear forms such the generalized Student ratio [15] for discriminating be- 
tween samples according multiple variates instead one only, the associated 
discriminant functions [10], the closely related figurative distance Mahalano- 
bis, Bose and Roy [5] and the critical statistic investigation Wald [28] 
the efficient classification individual into one two groups. these 
may very easily from the inverse matrix sums products, 
covariances correlations, from the principal components. Considera- 
tion the relations between two sets variates may utilize both the in- 
verse matrix and process resembling the principal 
nents. Similar computational problems arise applying sets numerous 
variates the contributions multivariate statistical analysis Fisher, 
Among the non-statistical applications the inverse matrix and latent roots 
and vectors are problems dynamics, both astronomy and airplane design 
[12], the analysis stresses and strains structures [26, 27], and electrical 
engineering problems [24]. 

Perhaps objection attempts statistical inference more common than 
that the variation this that relevant factor has been ignored. For example 
dealing with time series the need allowing for trend and seasonal variation, 
perhaps means sequence orthogonal polynomials for trend and 


Revision paper presented the Symposium Numerical held Dee. 
28, 1941 New York the Institute Mathematical Statistics and the American Sta- 
tistical Association with the cooperation the Committee Addresses Applied Mathe- 
the American Mathematical Society. For the program the Symposium see 
the Annals Mathematical Statistics for March, 1942, 103. 
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trigonometric functions for seasonal variation, well recognized. indeed 
desirable use regression equations with liberal number predictors 
eliminate spurious influences, well reduce the error variance, and likewise 
other statistical methods. But the computational difficulties the joint 
analysis the desired number variables have frequently seemed too formid- 
able. shall see how efficient techniques, conjunction with efficient 
machines, can far facilitate the use appropriate number variables 
reducing the labor modest dimensions. 

While the rise modern multivariate statistical theory has made available 
new exact tests hypotheses terms probability over wide range cases 
which multiple measurements are involved, such measurements have been 
accumulating large scale. many psychological, anthropometric, astro- 
nomical, meteorological and economic fields, actual measurements are available 
numbers variates far greater than have been regarded amenable, within 
practical limits, adequate treatment the numerical methods generally 
used. some instances the number cases which complete sets these 
variates are available also large. The 1931 census India included ex- 
tensive sample which fifty physical variates were measured for each individual. 
Karl Holzinger and his collaborators have worked out and circulated privately 
complete matrix correlations among mental tests. Astronomers have 
indicated the desirability recalculation the elements the solar system 
means gigantic least-square solution with 150 more unknowns, the 
same time deploring the seeming impossibility this ever being carried out. 
apply the methods modern theoretical statistics derive from such 
observations all the important information they contain enterprise whose 
feasibility depends new numerical methods. 

The chief computational problems, apart from those tabulating and provid- 
ing convenient approximations for the probability distributions, are (1) the cal- 
culation the many sums products pairs variates when large, and 
(2) operations the matrices these sums products such finding the 
inverse and the principal components. The first problem, which classical 
applications the method least squares long series has seemed the heavier, 
has sense been solved the use punched cards. card used for each 
case, and all variates are punched into it. running the cards repeatedly 
through machine wired each run select particular pair variates, 
multiply them together, and cumulate the products, this part the work may 
disposed with great speed. -The cost the machines does present limit 
the economical use this method rather large numbers, both variates 
and cases. This limit has recently been pushed upward the introduction 
improved multiplying calculators, with high-speed automatic multiplication 
and squaring locks. But these mechanical advances, combination with 
recent discoveries statistical theory, the increasingly felt need resort 
numerous variates, and the actual existence many cases data such mul- 
tiple variates, emphasize the need for rapid, economical and accurate calculations 
with matrices whose elements are sums products. 
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MATRIX CALCULATION 


Modern machine methods, especially those the punched-card type, but 
also those using machines such the Monroe, Marchant and Fridén, tend 
reduce the work formation sums products, comparison with other 
operations, such extent enhance the relative value methods 
which such calculation direct product-sums important. Thus products 
matrices are much simpler compute than inverses, and positive than negative 
powers. Indeed, powers and products matrices can computed means 
punched-card machines, and for large matrices this doubtless the most 
efficient procedure now available, though considerable rewiring needed. There 
also possibility, which does not seem too remote, development further 
devices this rewiring automatically. 


Iterative and direct methods. Partitioned matrices. later sections 
shall deal chiefly with certain iterative methods, giving particular attention 
the neglected question limits error stopping any point, and con- 
sidering the rate approach the desired solution. For finding the roots 
matrix and the associated vectors, the matrix has more than about four rows, 
seems clear that iterative method the most economical labor all but 
very special cases. the other hand the problems solving systems linear 
equations and finding the inverse matrix not usually yield readily 
iterative methods unless approximation the solution available begin 
with. This approximation not necessarily very close one, but must not 
too wild. may some cases obtained from general knowledge the 
subject. 

The Mallock electrical device [22] capable solving almost instantaneously 
ten linear equations ten unknowns with perhaps two significant digits each 
result, though this question accuracy remains elucidated. The com- 
bination this device with the iterative method Section below, and with 
the use partitioning for matrices more than ten rows, offers what seems 
present the best hope for the systematic inversion large matrices. Since 
only one the Mallock machines existence (it Cambridge, England), 
some adaptation the Doolittle related methods will ordinarily used. 
taking advantage the possibilities modern calculating machines ac- 
cumulating products reduce the amount writing required the Doolittle 
method, exceedingly compact and efficient methods have been developed for 
solving systems linear equations and for evaluating inverse matrices Dwyer 
9], who utilized the earlier work Waugh, Kurtz, Horst, Dunlap and Cure- 
ton cited him, and for solving systems linear equations, Crout [6]. 
Dwyer gives valuable bibliographies. 

some these methods, from general knowledge the subject, one 
may well obtain approximate solutions correct very small number decimal 
places, and then iteration get many more places are required, with 
labor far less than would necessary carry through from the beginning the 
requisite number places. Further applications iterative methods arise 
when least-square solution revised, either account new observa- 
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tions because errors discovered the original observations calculations, 
But however least-square calculation the evaluation any inverse matrix 
begins, and whatever intermediate steps are taken, seems advisable ter- 
minate with the method Section combines check the previous 
work, labor cost equivalent merely substituting the values found for the 
unknowns into the equations, with improvement accuracy and useful 
limit error for the unknowns. 

the inversion large matrices there are important possibilities the 
properties partitioning. For example, square matrix rows may 
partioned into four square matrices rows, and written 


this multiplied the right another partitioned square matrix 
rows which may written 


algebra except that care must used perform matrix multiplications cor- 
rect order. Thus 


where are square p-rowed matrices, the product 
identical with the result partitioning the product the two original 2p- 
rowed matrices. the second the inverse the first, this product the 
identical matrix. Consequently, the first matrix given, have for de- 
termining its inverse the four matrix equations 
where stands for the identical matrix rows and for the p-rowed matrix 
consisting entirely zeros. equations may solved just elementary 


These formulae call for inversion four p-rowed matrices, namely 

and Without changing the number such inversions may 

choose alternative sets matrices invert, with economy labor certain 

cases. For example, easy invert, may use for the expression 
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MATRIX CALCULATION 


The formulae and numerical work are further simplified the given matrix 
symmetric. Other modes partitioning are also possible, and may valuable 
various kinds numerical work. Another method obtaining the inverse 
matrix partitioning given Frazer, Duncan and Collar [12, pp. 112- 
118], who also give account general properties partitioned matrices. 
the treatment relations between two more sets variates [18, 31], 
partitioned matrices appear. 

The most efficient method function matrix will depend 
part what else For example, the latent roots and 
vectors are needed for any reason well the inverse matrix, better 
the former first, and then the determination the inverse matrix 
becomes trivial task; but the latent roots and vectors are not needed for some 
other purpose usually better not calculate them but use more direct 
method obtain the addition the inverse the determinant 
wanted, many consecutive powers matrix, matrix-multiplying 
machine considerably speedier than present procedures becomes available, 
method [3] based the Cayley-Hamilton theorem that matrix satisfies its 
own characteristic equation may recommended. 

Iterative methods have what Whittaker and Robinson [30] call the pleasing 
characteristic that mistakes not necessarily spoil the whole calculation, 
but tend corrected later stages. This course does not mean that there 
penalty for mistakes. They have obvious tendency prolong the 
number repetitions required, and repeated late stages may actually pre- 
vent realization substantially correct result. less obvious consequence 
mistakes near the termination iterative calculation that they tend 
vitiate any limits error that may derived, those that will found 
below. Great care should used insure accurate calculation especially 
the last stages any iterative process. 

insure accuracy even before the last stages, and therefore efficiency, 
check column consisting the sums the elements the rows matrices 
multiplied and added together may well carried along. multiplying two 
matrices only the check column the second factor used; multiplied 
row the first factor obtain the check column for the product. 
computer thoroughly experienced with matrix multiplication may dispense with 
the check column all stages but the last iterative process, relying the 
self-correcting property the process. 

simple but extremely valuable bit equipment matrix multiplication 
consists two plain cards, with re-entrant right angle cut out one both 
them symmetric matrices are getting the element 
the row and jth column the product, the ith row the first factor and the 
jth column the second should marked card beside, above, below it. 
writing symmetric matrix convenient omit the elements below the 
principal diagonal. The re-entrant right angle then utilized mark off the 
numbers belonging particular row. 
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report [13] certain iterative methods solving linear and other equations 
and calculating latent roots and vectors, with engineering applications, was 
certain problems psychology [16] the present author 1933 
iterative processes both for solving systems linear equations and for finding 
principal components, and later [17] showed how accelerate convergence 
principal components repeatedly squaring the matrix. Further acceleration 
convergence other devices has been discovered Aitken 
Geiringer has also discussed method solution equations involving iteration 
small groups unknowns [14]. The method Kelley and Salisbury 
should noted. has been used extensively psychologists. Definite 
limits error and measures rate convergence for this method are missing. 
Certain other iterative methods will discussed later sections. will ap- 
pear that the most-used methods are means the best. 

Questions regarding the probability matrix satisfying 
particular conditions computational significance may some cases il- 
luminated with the help the theory the variates random sample 
larger aggregate. This theory was outlined the latter part the paper [16]. 


AND INVERSE MATRICES 


Accuracy direct solution linear equations. question how many 
decimal places should retained the various stages least-square solution 
and other calculations involving linear equations has been puzzling one. 
has not generally been realized how rapidly errors resulting from rounding 
may accumulate the successive steps such procedures as, for example, the 
Doolittle method. this popular algorism for solving system equations 


tion alone accomplished calculating successively 
then 


and forth. Let suppose that each the and subject 
error concerning which known only that its absolute value does not exceed 
Thus they are given accurately decimal places only, have 
Let the actual errors represented and these are 
small estimate the error may obtained expanding Taylor 
series and retaining only the linear terms: 
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The closest upper bound for this error obtainable without special assumptions 
regarding the values the given quantities specified the inequality 


The a’s and g’s are often correlation coefficients. Any set normal equations 
least squares may reduced form which this the case, and this re- 
duction has considerable merits. The various correlation coefficients are fre- 
quently interest themselves, and their use the normal equations 
practically insures that all the quantities appearing any stage are the same 
order magnitude. This last very substantial advantage, partly because 
the check column which customarily carried along, which each entry 
the sum the other entries its row. Since the absolute value correlation 
coefficient less than unity, and since a;; becomes equal unity, the last in- 
equality gives this case 


and closer inequality appears possible. the same way find for this 
case which the a’s are correlation coefficients that 


Proceeding from these inequalities the same way, and neglecting the fact 
for the errors and the estimated upper bound with actual 
upper bound somewhat higher unless Continuing the same way 
find for and the estimated limit error with pos- 
<1. The rapidity with which this increases with caution against 
relying the results the Doolittle method other similar elimination 
methods with any moderate number decimal places when the number 
equations and unknowns all large. the limit error exceeds 
million times indicating that only one decimal place wanted the value 
the original correlations must utilized least seven decimals, even 
neglect the additional errors introduced dropping decimals beyond 
those retained the intermediate stages the calculation. The errors ac- 
cumulate further during the back solution, that all the unknowns are 
wanted with one-place accuracy necessary use the original correlations 
with substantially more than seven decimal places. For larger values the 
increase the error limit startling. Thus for (the number tests 
reported involved certain current procedure classifying military 
personnel) the limit error even for the first unknown evaluated repre- 
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senting loss about decimal places accuracy, while the correlations 
Holzinger’s 78-rowed matrix would need carried less than places 
insure even approximate accuracy the first decimal place one the 
regression coefficients formula derived least squares for predicting one 
his variates terms all the others. 

These high limits error may possibly reduced the following ways: 
(a) more exact study the error might made means terms the 
Taylor series orders higher than the first; (b) the positive definite character 
correlation matrix (or other matrix normal equations) might utilized 
attempt arrive lower limits error; instead considering the 
maximum possible error might depend some mutual cancellation dif- 
ferent errors and content ourselves with statements terms probability. 
The compounding different errors rounding, which may individually 
regarded having probability distribution uniform density over fixed 
range, quickly gives rise almost exactly normal distribution known 
mean and variance, that the probability approach attractive. However 
the limits error obtained this way with, for example, five per cent level 
probability greater error, though somewhat smaller than the limits asso- 
ciated with certainty, are disappointingly large. Investigations the types 
(a) and (b) have not been made; they would apparently very cumbersome, 
and (a) might have the effect increasing the error limits considered above 
instead cutting them down. Use the check column does not provide any 
safeguard against the errors rounding appearing the original correlations, 
though from the probability standpoint, carefully devised use the check 
column may mitigate the accumulation errors successive stages. 

control such errors reliance often placed substitution the solution 
obtained the given equations. This not completely satisfactory, since 
under some circumstances large errors the solution may yield only slight 
deviations the left from the right members the equations, and since some 
deviations must expected any case which only limited number 
decimals carried along. Moreover this substitution, even reveals the 
existence errors, does usually make clear what should done 
about them. recalculation larger number decimal places horribly 
laborious. There here distinct need using iterative process for im- 
proving the solution obtained, and setting definite limits for the errors. 


The classical iterative method. The iterative method which seems 
the oldest and the most used for solving systems linear equations, and which 
may like all other methods doing this applied find the inverse 
matrix, that Gauss and Seidel. seems also used the 
[26], which has been recommended engineers but lacks limits 
error and measures rate convergence. 

This classical method, starting with any assumed values for the unknowns, 
begins changing the value for the first unknown satisfy the 
equation; this possible the coefficient different from zero. The revised 
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MATRIX CALCULATION 


set trial values then further altered changing the second unknown 
satisfy the second equation. Then the third unknown altered that the 
third equation will satisfied, and forth. When all the unknowns have 
been thus altered the cycle may begun again, and repeated until the differ- 
ences between consecutive values each unknown become small enough 
indicate satisfactory convergence. The method converges the matrix 
the coefficients a;; positive definite, for the normal equations least 
squares, and also certain engineering applications [7, Moreover the 
character being positive definite insures that each a;; differs from zero, that 
the successive adjustments indicated are all actually possible. the published 
discussions, proofs convergence have sometimes been omitted, and some 
(e.g. [30], See. 130) the proofs are incomplete. Even the fuller proofs 
and [16] fail give explicit limits for the errors stopping any particular 
stage. But from the discussion [16, pp. 502, 504] easy see that positive 
numbers and exist, with such that the error the mth estimate 
less than This limit error diminishes geometric progression with 
iterations; hence the number decimal places accuracy increases 
approximately progression. The progression however irregular 
and the trial values may considerably. Numerical determination 
limits error does not appear with the method indicates 
that satisfactory only case really good approximation available 
begin with, spite its universal convergence. 


acceleration and extension the classical iteration. This classical 
scheme may improved the following way numerous cycles revision 
the trial values are expected needed for the requisite accuracy. The 
first step, consisting replacing the trial value 


where the symbol introduced for convenience order make these equations 
homogeneous, always equal unity. The matrix the transformation, 
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course singular. denote the one-column, 1)-rowed matrix 
the initial trial values, with unity the head the column, the column matrix 
the result this first operation, again with unity the head 


the column. The trial values obtained the second operation appear likewise 


where the matrices are the same simple character 
illustrated and This same result will obtained, because the as- 
sociative law matrix multiplication, first caleulate numerically the 
matrix 


The result complete cycle substitutions may written 


and then (Experience shows that computers need this point the 
caution that the matrices must arranged their proper good pro- 
cedure first form then multiply this the left, ete.). This 
requires rather more work than the original Gauss-Seidel scheme, and therefore 
not worth while only one cycle substitutions needed. 

The advantage lies the fact that may readily squared, and gives 
result equivalent that two full cycles iteration the Gauss-Seidel 
method. Furthermore, may squared give which may also 
squared, and soon. Obviously such squarings give matrix which, when mul- 
tiplied yields the same result complete cycles the original sub- 
stitutions. terms the number squarings the number decimal places 
accuracy tends increase geometrical instead arithmetic progression. 
This modification the classical method does not seem have been published 
heretofore, though both and the method Section have been use the 
author and his students since 1936. 

Fisher [11, Sec. 29] has introduced the valuable method finding the 
unknowns, with the same matrix coefficients, but different columns 
unknowns; these several columns unknowns are the elements the identical 
matrix. The technique carrying this out any the methods resembling 
that Doolittle simple extension involving replacement the right-hand 
members the equations 1’s and 0’s and carrying along such columns 
instead one while applying exactly the same linear operations the rows 
the older problem. This, like the problem solving linear equations, has 
been elegantly adapted efficient with modern machines Dwyer 
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(7, 9]. The foregoing iterative methods may also applied this case, but 
the matrix will different for the different columns. When the given matrix 
symmetric (as implied the positive definite character assumed 
the proofs convergence) the number iterations required generally cut 
down because the determination each column determines also the elements 
the corresponding row which lie other columns. Iteration groups 
[14] may well have place here. 

observation Aitken’s [1] noteworthy connection with the 
solution equations with non-symmetric matrix, and with the finding the 
inverse such matrix. Writing the equations the matrix form 
see that the solution also the solution the system (A’A)X 
where the transverse (also called the transpose conjugate) 
Evidently A’A and A’G can formed direct multiplications and additions, 
without divisions. Since A’A symmetric, any the methods for solving 
symmetric equations are applicable the new system. find the inverse 
may first find the inverse the symmetric matrix A’A and then postmultiply 


Roots, norms and convergence matrices. The norm matrix may 
defined the square root the sum the products its elements their 
complex conjugates, and denoted N(A). real and a;; the element 
the row and jth column, 


(6.1) N(A) 


This the same function which Wedderburn [29, 125] defines the absolute 
rather troublesome avoid confusing this with the determinant use 
the notation N(A), though the analogy with the ordinary absolute value 
quantity very suggestive connection with proofs convergence and limits 
error obtained means the “triangular inequalities” [25] 
gives different definition the absolute value the matrix the maximum 
the absolute values its roots. 

The triangular inequalities, whose proof easy with the help the Cauchy 
inequality, are: 


(6.2) N(A N(A) N(B), 
(6.3) N(AB) 
From the last follows that for any positive integer 
(6.4) 


Hence N(A) the limit N(A”) inereases zero. then follows 
that the limit itself zero, i.e. that each its elements approaches zero, 
because the definition the norm. 

The identical matrix rows, which shall denote simply has the 
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norm while matrix one with the quantity each element 
the principal diagonal and zeros elsewhere) has the norm norm 

The roots square matrix, also known the latent roots 
roots, are the values for which the determinant obtained 
subtracting from each element the principal diagonal expand- 
ing this determinant powers and using relation between roots and 
cients equation, evident that the sum the roots equals the sum 
the elements the principal diagonal. sum known the trace the 
matrix and denoted tr(4). Thus 


From the definitions the transverse and norm plain that 
(6.6) tr(AA’) 
real. 
any polynomial f(A) matrix whose roots are known [29, 


Consequently 


All the roots zero matrix are zero. But the fact that all the roots 
matrix are zero does not necessarily imply that the matrix zero; for example 
the roots 


(6.8) 


are both zero. But for real matrices the vanishing all the roots 
does imply the vanishing the matrix; for the sum the squares the elements 
symmetric matrix equals the sum squares the roots, A’, 
and (6.7), (6.6) and (6.1), 


Moreover, continuity considerations, sequence p-rowed symmetric 
matrices must approach zero all the roots approach zero, and conversely. 
From this evident that necessary and sufficient condition that 
approach zero increases, when symmetric, that all the roots 
less than unity. This provides sharper criterion convergence than the 
requirement that N(A) which sufficient but not necessary for conver- 
gence. The latter however far easier apply most numerical work, since 
far easier compute N(A) than the greatest root. easy 
set upper bound for various ways, which the notice 
that, (6.1), N(A) times the greatest absolute value any 
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element Also, the test terms the norm applicable 
well symmetric matrices. 

From these considerations regarding the convergence deduce once 
the following result. the norm square matrix less than unity, then all 
the roots are less than unity absolute value. The converse not true, the 
example (6.8) shows. 

For any real square matrix symmetric not, 


prove this, observe first that and consequently 
From (6.7) and (6.6) then have tr(AA’) 
This reasoning shows incidentally that real, though the indi- 
vidual roots may complex. 

Not only for investigating convergence, but also the important but neg- 
lected problems setting definite limits error after finite number steps, 
the norm extremely useful function. matrix computed with 
such that the error each element less than and the matrix 
errors, the requisite accuracy will according (6.1) attained when N(A) 
The definition and theorems regarding the norm are valid without any 
restriction square matrices, for which alone the roots are defined. For 
example, may use the norm derive inequality concerning the solution 
the system linear equations 


which may written matrix form where square matrix and 
and are matrices each one column and rows. From (6.3) find 


N(A)N(X), whence 
N(X) N(G)/N(A). 
shall now deduce result which seems new matrix theory and 


which shall later apply find limits error. any matrix such that 


holds, and may demonstrated exactly were ordinary scalar quantity. 
Suppose that N(A) Taking the norm and using (6.2), (6.3) and 
(6.4), have 


sion, obtain: 
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This holds for every positive integral value and therefore the limit when 
becomes infinite. Thus find that 


efficient inversion procedure. Let approximation the 
inverse matrix and consider the following sequence 
culate 


Let inquire the conditions under which the sequence matrices 
converges the maximum error that may committed stopping 
any stage, and the rate convergence. Suppose that good enough 
approximation make the roots the matrix 


(7.3) 


all iess than unity absolute value. Then increasing powers approach 
zero, and the convergence will follow from the relation 


which will now proved mathematical induction. From (7.1) and (7.3), 


that (7.4) verified Now assume (7.4) for particular value 
and substitute (7.2). This gives 


which being the same form (7.4) completes the induction. 

N(D) the roots are all less than unity absolute value, 
shown Sec. and the foregoing result holds. Assuming this true 
now derive upper bound for the error terms and 
cording (7.3), 


Hence, (7.4), 


Therefore, (6.3), (6.4) and (6.10), 
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This sets upper bound for the difference between each element and the 
corresponding element slightly looser but simpler limit may 
obtained from this terms the greatest absolute value any element 


The great value this method, whenever good enough initial approximation 
available make N(D) less than unity, the number decimal places 
sure accuracy increases geometric progression, rather than arithmetic 
progression with the usual methods. Consequently this method will always 
the most efficient sufficiently large number decimal places required. 
Moreover, limit can set advance for the number iterations that will 
required order insure any required degree accuracy. certainty 
correctness the sth decimal place required may choose that the 
right-hand member (7.6) less than terms logarithms the 
base the number decimal places whose accuracy assured iterations 
thus least 


These limits error can bettered after some iterations have actually been 
made. When becomes available may calculate AC,), which 
and generally enough smaller than make marked improvement. 

The elements the matrix errors will actually, course, smaller than 
the norm this matrix every practical case, ratio fluctuating about 
The limits obtained our formulae can reached only case the entire error 
the matrix concentrated one element, very unlikely event. Thus 
the limits given above will usually quite conservative. 

the iteration proceeds the elements the matrix D,, 
will diminish rapidly case convergence. For this reason may sometimes 
better calculate not directly from (7.2), but from the formula 


which the last term can regarded correction which will often 
very small. This method, however, lacks the self-checking feature, that 
its use the final stage dubious. 

This iterative process has been noticed previously [12, 120], but without 
limit error observation the geometric progression the number 
accurate digits. 

the initial approximation not good enough make N(D) may 
improved other methods, such those Sections and the point 
which this more rapid method becomes applicable. But some cases (e.g. 
the second example §8) the method converges even though N(D) 


f 
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may demonstrated later stage which the norm the matrix corre- 
sponding becomes numerically less than unity. 

For the mass least-square and other problems which the inverse 
matrix needed, the best procedure appears begin with one the methods 
described Dwyer [7, 9], carried small number decimal places, and 
then calculate from (7.3), step equivalent substituting the approximate 
solution obtained into the equations. may then evident glance that 
the norm small that the method the present section will converge 
rapidly give many more places desired. N(D) too large for this, 
and gross errors have been eliminated, there choice between recalculation 
from the beginning, the classical iterative process, and the acceleration this 
process matrix-squaring, with perhaps some iteration small groups. The 
choice will depend partly how much the elements need reduced. 
The classical iteration (or sometimes the process this section) appropriate 
for correcting slight excess N(D) over unity, its matrix-squaring extension 
for larger alterations. 


symmetric, naturally take symmetric matrix, and this will 
follow from the last equation that negative definite, negative semi- 
definite. Consequently the diagonal elements tend underestimate the 
corresponding elements and never exceed them. Furthermore, the 
value quadratic form whose matrix will least great the 
estimate based The squares, both the multiple correlation 
coefficient and the generalized Student ratio [15], can expressed such 
quadratic forms. Hence both these statistics are slightly underestimated when 
used place the true matrix coefficients. Later approximations 
not change the signs these biases, though they make their magnitudes 
approach zero case the conditions for convergence are satisfied, and definite 


limits converging zero are easily found for them such cases from the results 
above. 


Illustrations and further comments. shall indicate symmetric matrices 
writing only the elements and above the principal diagonals. 
illustrate various methods Dwyer [7] has evaluated the inverse 
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1.0 
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1.3989 .2732 
1.6940 


the accuracy the calculation had been only such insure correctness 
the first decimal place the approximation would have been 


1.4 
1.7 


easy mental arithmetic alone, without the use machine side 
calculations, see that 


and further that N(D) +/.0052 .072. This much less than unity 
that the iteration process will converge rapidly. matter fact, 
without determining the the squares the elements could 
have observed glance that N(D) must less than four times the greatest 
absolute value element, and thus have value less than .16. the same 
way seen less than 8.4; actually equals 3.8588. The latter 
value, with .072, substituted (7.5) gives for the norms the 


This promises merely that after one application the iterative process the 
results will accurate one decimal place, which know already but might 
not have known for sure such case; that second iteration will give results 
accurate three places, and that third will give results accurate about 
eight places. These estimates will however improved after actually com- 
puting This may well done (7.1) machine available; otherwise, 
and almost easily, (7.8) obtain 


1.398 

1.692 
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and 3.8163. (We have now passed beyond the stage easy mental 
calculation, but might alternatively use the easy upper bound 8.28 for 
obtained before.) shall use this value instead N(Co) (7.5) and 
the same time use for the value where 

This most easily found from from which may written down directly 
mental calculation: 


The norm seen the crude method less than .0072, and actually 
.003212. Taking the latter value for have, similarly (7.5), 
Thus, 
.0246, 


.000 0789, 
.000 000 000 


The reduction these limits error due the difference between 
and N(D*) .003212. 


1.3989056 .2732260 
1.6939852 


From this 


agreeing with the value obtained from the formula and finally 


2.071 038 458 .191 256831 .775 956 284 928 962 
1.284 153005 .218579235 257 

1.398 907 104 .273 224 045 

1.693 989 071 


which shown above correct least eight decimal places, and doubtless 
more, each element. The estimate obtained Dwyer several 
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direct methods four places corroborated this result excepting for 
slight error the element his first row and third column. 
(ii) Suppose that the approximation the foregoing example had been even 
with determination the elements only the nearest integer. 
This would give 
tly 0-1 


than unity, and not clear this stage whether the iterative process have 
been using will converge not. But upon computing 


find that N(D*) .7806, and since this less than unity are 
assured that the process will converge. may write immediately, without 
use machine written side 


1.0 


—.2 
The sum the squares the elements 1.51, that the norm greater 
§ 


Utilizing the value already determined, readily find 


1.558 


From this point machine needed for efficiency. The next step 
methods may used check. The result is: 


808 730 1094 1330 


616 152 1696 
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may now consider the accuracy further approximations, inserting 


N(D*) .4119. Thus 


Because the roughness the initial approximation this case the con- 
vergence rather slow first, but later much accelerated. far the 
limits found above show, five iterations are necessary sure even approxi- 
mate two-place accuracy the results (somewhat better limits could ob- 
tained after actually still better ones from but the 
sixth iteration gives results sure accurate nearly five places. Perhaps 
the best treatment numerical case this kind work out the solution 
Dwyer’s method two, three four places, and then apply the iterative 
process once, and many more times necessary obtain the required 
accuracy. 

The final step should, for the sake checking, calculation from 

Upon observing that N(D) might have used the Seidel process 
improve each row This process however extremely slow, and the 
present example markedly inferior that used above. 

(iii) start from the result which Dwyer gives four decimal places 
obtain 


find 3.8188, and putting N(D) .00085 have from (7.5), 


000 000 0004. 


Certain other methods successive approximation. class methods 
for solving linear equations, which may extended find the inverse 
matrix, given Frazer, Duncan and Collar pp. 132-133], generalizing 
method this method the matrix the coefficients the 
linear equations, the matrix inverted, written the sum easily 
inverted matrix for example diagonal triangular matrix, and another 
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matrix Then 
where the latent roots are all less than unity absolute 
value, and N(f) the series 
converges. solve the equations where and are column vec- 
tors (i.e. matrices one column) determine 
where The method Frazer, Duncan and Collar calculate 
the successive vectors 
clear that 


The error therefore the vector 


and (6.10), 


where the number unknowns; and individual unknown will have 
error greater than 
Convergence this method, existent, may accelerated matrix- 
squaring. Indeed, upon calculating turn f*, repeated squar- 
ings, need only work with the sequence 


omitting the intermediate approximations. This will worth while for solving 
single set equations only case such great accuracy required de- 
mand the use rather high powers Each squaring consists the 
formation sums products, that determination of, say, this 
method requires such sums after has been determined, addition the 
involved finding X,, after the squarings. the 
method Frazer, and Collar the corresponding number sums 
that the matrix-squaring justified only for six fewer unknowns unless 
larger number terms required. Furthermore, increasingly high powers 


matrix, useful, need usually expressed with more and more signifi- 
cant digits. 
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more than one system equations with the same matrix solved, 
these methods have the advantage that the same matrix can used for all 
the vectors right-hand members. such cases the value matrix-squar- 
ing enhanced comparison with that which only single system equa- 
systems which the several column vectors together constitute the identical 
matrix. more than these systems equations are solved best 
find and then form the various solutions from the columns 
right-hand members. 

are all rational functions single matrix. difficult cases this may 
sionally provide useful check. 

This method differs from the other iterative methods with which are 
concerned that errors calculation are not automatically corrected it. 
This serious disadvantage, especially for the inexperienced computer, and 
makes desirable the careful maintenance check column. the other hand, 
does not require any preliminary knowledge the solution. Indeed, should 
classified rather with the direct than with the iterative procedures this 
account. 

The critical element determining the success this method the possi- 
bility impossibility finding suitable matrices and such that can 
calculated easily, and such that the elements are sufficiently 
small make the roots all numerically less than unity. Morris uses for the 
matrix derived from replacing all the elements above the principal diagonal 
zeros. This insures that the corresponding positions are also 
zeros. The other elements are then determined fairly easily. the 
non-diagonal elements which appear are sufficiently small, this fact 
will insure small enough elements make convergence rapid. 

second method, given Frazer, and Collar, chooses for 
diagonal matrix (one having only zero elements except the principal diagonal), 
simply the unit matrix. This choice reduces the labor inversion 
minimum. Successful convergence will take place when the non-diagonal 
elements are sufficiently small comparison with those the diagonal, 
taken the diagonal matrix containing the diagonal elements 

third method which may useful certain cases, particularly when some 
but not all the unknowns are required, the following. Let partitioned: 


where and are square submatrices which, being lower order than are 
more easily inverted. Let and the correspondingly partitioned matrices 
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only the first the unknowns are required, and may taken 
and and and consist the remaining rows, then and 
Then, case convergence, the first rows the solution 
are given 


Convergence the correct values assured here the norm any power 
less than unity, true and only the absolute values all the roots 
are less than unity. This easily seen true, since increases 


10. simple iterative method solving equations. entirely different 
method, whose convergence independent the initial trial values, the 
following. solve for the column vector the equation may 
start with arbitrary column trial values and constant and 


equal obviously the desired solution. Otherwise there 
error 


This converges zero increases provided the latent roots are 
all less than unity absolute has only real roots this equivalent 
requiring that they all between and 2/h. particular, correla- 
tion matrix, its roots are all real and positive. Since their sum 
where the number rows, all roots lie between and 
order make the error diminish fast possible, take large con- 
sistent with convergence. some cases lower limit than will known for 


the greatest root and then value than 2/p can taken for 
limit error obviously set 


This method can course applied find the inverse matrix. 
have for example, 
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The last term will approach zero case convergence, and may omitted 
this type calculation. 

Thus accelerated, the method gives decimal places accuracy increasing 
geometric instead arithmetic progression, and remarkably simple and 
straightforward. its best when the roots are known closely 
clustered about unity. criterion this =(A; shall have small 
value, where the mean the roots \;. This sum squares equals 


that 
Smallness this quantity favorable not only this iterative method but 
also those §§4 and 


11. Use the characteristic equation for inversion and for finding deter- 
minants. method differing greatly from the others based the Cayley- 


Hamilton theorem that every matrix satisfies its own characteristic equation 


and also equals the sum the r-rowed principal minors the matrix Sub- 
stituting for which the Cayley-Hamilton theorem legitimate, multi- 
plying and transposing term, yields 


This equation provides direct method calculating soon the ele- 
mentary functions the roots have been evaluated. 
This evaluation may accomplished means Newton’s identities [4, 243] 


connecting the elementary symmetric functions with the power-sums. 
the sum the rth powers of.the roots, these formulae give: 
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from the diagonals these matrices, since tr(A’), then obtain the ele- 
mentary symmetric functions the roots from Newton’s formulae, 
and substitute these the right-hand member (11.1). then only 
necessary find and divide which equals the determinant For 
this, and for checking the calculations, there choice methods. may 
find the diagonal A”, without troubling compute the whole this matrix, 
from the product and also, provide comprehensive check, from 
possibly from the product two powers exponents approxi- 
mating p/2. The sum these diagonal elements which may 
substituted the last the Newton formulae above with the quantities pre- 
which computed (11.1), obthin the determinant 

The total number multiplications, divisions, and additions distinctly 
greater this method than efficient direct methods such that Dwyer 
the other hand, this method straightforward and easily checked; 
the divisions involved are the simplest character, consisting only the 
adjoint matrix one number; and for large matrices ideally adapted for 
matrix multiplication means punched cards. further very important 
advantage this characteristic function method that yields considerable 
additional information by-product. Not only the determinant the 
matrix but the sums the principal minors each order are determined. 
Moreover the characteristic equation, whose coefficients would exceedingly 
difficult compute directly from definitions for large matrix, this method 
made available for the study the latent roots, which have great interest 
themselves for numerous purposes. 

The characteristic function method applicable whether symmetric 
not. the same true each the other matrices appearing 
the that necessary write only about half the elements. 

illustration using symmetric matrix has been given Bingham 
the illustration below the matrix not symmetric and has complex 
double roots and non-linear elementary divisors, that evaluation the roots 
iterative methods, though possible, would very slow and laborious, 
shown Aitken indeed the same example used Aitken this 
discussion. But should noted that the associated latent vectors, which 
are determined along with the roots the iterative processes, require the 
solution sets linear equations the roots are found directly solving 
the characteristic equation. 
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Then 
—40 105 —40 —617 —380 499 256 
—76 —43 —260 —189 —316 355 280 
—61 —464 —300 —136 439 292 
—40 110 —14 —40 —617 499 256 
—1342 2444 2006 
944 522 503 


—1312 2444 1991 


From the diagonals these matrices, 
Calculating the sum the diagonal elements only (on machine, without 


listing them separately) from and also, check, from find 
3185. Newton’s formulae then give 


the last value being that the determinant readily find from 


—207 111 171 
—315 195 270 


far, all results this method are exact, but the division 225 introduces 
recurring decimals and therefore limited validity for the form 


1.8400 —.2356 —.2311 


The characteristic equation 
may this case solved readily, since 


12. Direct and iterative methods. the latent roots but not the latent 
vectors matrix are desired, for example preliminary study vibra- 
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tions machinery being designed, where the important question whether any 
root has positive real part, only necessary find the characteristic equation 
and work with the methods the theory equations. The coefficients 
the characteristic equation are the sums the principal minors 
p), and are expeditiously found directly from this definition for 
matrices four fewer rows. For large matrices, however, the calculation 
many large overlapping determinants wasteful effort, since many vir- 
tually equivalent calculations must done calculation 
determinants great many situations, including the solution linear equa- 
tions, open this objection. The methods §11 the characteristic 
function manner which, for large matrices, appears the best available, 
excepting perhaps the new method Samuelson [25a]. 

When, commonly the case, the latént vectors are desired, straight- 
forward calculation directly from the definitions would require not only setting 
and solving the characteristic equation, but also the solution, the case 
each root, the set linear equations unknowns whose matrix obtained 
from the characteristic matrix substituting the particular root for 
this solution linear equations that aggravates greatly the computational 
labor when direct methods are used. 

ingenious method has been used Fisher [11, pp. 299 ff.]. Starting 
with four-rowed determinant whose elements are linear functions un- 
known Fisher the value the determinant for selected values 
and then interpolation using divided differences finds the largest value 
making the determinant zero. The point the divided difference method 
that avoids the direct calculation the determinant for more than few 
values replacing essentially calculation the fourth-degree poly- 
nomial from its differences and using the fact that the fourth divided dif- 
ferences are constant. The linear equations are then solved direct manner. 
applied large matrices this would very laborious, but compares fa- 
vorably with directly from definitions the manner suggested 
reading books algebra and solid geometry. But even with large 
matrices Fisher’s method may perhaps the best certain cases, e.g. all 
that desired the root median absolute value and this root real, 
desired find few real roots that are close together, with numerous others 
greater and another numerous group less than these. This because the itera- 
tive methods give the real roots the order their absolute values, beginning 
with the greatest, but with the possibility obtaining them the opposite 
order first inverting the matrix. The Mallock electrical device [22] may 
used calculate determinants, and thus apply this method. 

and are p-rowed matrices and non-singular, the determinantal 
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any problem this type reducible that finding latent roots and vectors, 
upon calculating any method and multiplying either order 

The fundamental iterative method for finding latent roots and vectors 
begins with arbitrary matrix single column. This column vector 
though unlikely, the elements are proportional those they con- 
stitute one the latent vectors and the factor proportionality the 
corresponding root, for then and are solutions the matrix equation 
should observed that the latent vector determined only 
within arbitrary scalar factor proportionality, though may some- 
times find convenient normalize the vector choosing the factor such 
way that the sum the squares the elements, which equals the square 
the norm, unity. 

they are divided by, say, their respective first elements, then the other 
elements will (in the cases greatest practical importance) gradually approach 
stable values which will determine one the latent vectors, while the suc- 
cessive factors proportionality will approach the corresponding root. The 
convergence this process however apt rather slow. Fortunately 
there are several known ways accelerating it. 

Matrix-squaring the first these methods accelerating convergence 
19]. clear that Consequently one application the 
iterative process with equivalent iterations with relatively 
easy square and then repeated squarings form ete. The 
economical limit this process determined partly the necessity re- 
taining more and more digits the successively higher powers, but 
point not yet determined exactly presents very great advantages. For pro- 
ceeding the determination latent roots other than the maximum absolute 
value, with their associated vectors, this method lends itself further short- 
cuts [17, 2], which seem give advantage over older method [13]. 

Another method accelerating convergence, introduced Aitken, 
and referred him the uses the ratio element 


which converges rapidly toward the root greatest absolute value. 
constant subtracted from all three the quantities 1), and 
before computing the foregoing function the result unchanged. This fact 
reduces greatly the computational labor, since the decimal places already 
determined are common all three. 

symmetric and form the scalar products with itself 
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and with have 


The ratio these two scalars gives estimate which the basis the 
ratios consecutive elements given place the trial vectors would not 
reached until later stage convergence, corresponding fact twice 
many iterations. Aitken has pointed out the great value this procedure for 
finding the root (but not the latent vector), and has extended the idea asym- 
metric matrices, where there complication because the existence two 
latent vectors for each root, one determined premultiplying the other 
A’. 

The comprehensive paper [2] Aitken gives extremely valuable account 
the whole problem and processes finding the latent roots and vectors, in- 
cluding survey the various cases arising when there are multiple roots, 
complex roots, and non-linear elementary divisors. This paper should studied 
anyone with any substantial numerical problem this kind. 

method using rotations two variables time has been devised 
Kelley [21]. 

The remainder this paper will concerned with some results, believed 
new, which useful upper limits can set for the errors the results 
yielded iteration for latent roots and vectors symmetric matrix. 
find such limits error for asymmetric matrices appears much more 
difficult and yet unsolved problem. 


13. Accuracy iteration with symmetric matrices. symmetric, 
most statistical problems (though with some exceptions, [18]), the 
roots are all real and the elementary divisors are linear. Moreover there exist 
orthogonal matrix and diagonal matrix 


such that 
(13.1) 
Since orthogonal, HH’ and therefore 


may associate with the successive trial vectors the 


vectors then From these equations and the second 
(13.2) clear that 
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Y; 

and let 

(13.4) 


has negative root this fact will become evident after certain stage 
the iteration used obtain this root alternation sign the numbers 
any one position consecutive trial vectors. However which pointed 
out §12 may well calculated anyhow, has only positive roots, which are 
the squares the roots and has the same latent vectors Hence 
shall have results sufficient generality for real symmetric matrices as- 
sume that all roots the matrix with which work are positive zero, 
that positive definite semi-definite. Let choose the notation that 


Hence, (13.4), 


(13.5) 
known [23] that are any positive numbers, the 
function 


monotonically increases. Hence the best these lower bounds for 
that corresponding the least value that can used, namely 
Consequently the lower bound for given 


(13.6) 


Vit 


From (13.4) easily seen that this lower bound approaches when increases, 
provided 
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upper bound for available from the fact that the sum the powers 
the roots the trace A‘. Since assume all this gives 


(tr 


That this upper limit converges when increases easily seen from (6.7) 
upon consideration log 

lower limit alternative that (13.6) also available from and 
likewise converges Indeed, since the greatest root, have 


(tr 


now seek limits accuracy for the latent vector corresponding and 
then lim Y*, where the form has its ith 
element 


t 
lim 


this case. multiple root, the limit will depend the initial 
values 


t 


t 

Qot 

therefore approaches zero approaches unity. shall seek for lower 
limit approaching unity increases. 
Let now put 
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k 
Qot 


This unfortunately not very useful lower bound for since approaches 
zero, not unity, increases. 
more satisfactory result obtained follows. Let Then 


Pp 


taking the positive values With the positive weights, probabilities, 
The kth moment this distribution about particular the 
first moment and evidently least equal which the least the 


the Tchebychef-Bienaymé inequality, 


and therefore 
2 lt 


provided large enough that This lower bound approaches 
unity, desired, when increases. 


provided The left member the correlation with that one 
the k-parameter family vectors corresponding the multiple root 
for which the correlation maximum. 

order utilize these results need lower bound for 72, for 
case this requires upper bound for upper bound may 
found the next stage through working with the reduced 


Since have arrived definite lower limit for which approaches unity 
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the iterative process proceeds, and since have found for upper and 
lower bounds converging it, solution has been found for the troublesome 
problem the degree accuracy stopping any stage the iteration for 
finding the greatest root and the associated latent vector. would possible 
find from these results appropriate inequalities for and then 
repetition the above arguments, for and the second latent vector; and then 
likewise for the second reduced matrix and the further roots, vectors, and 
reduced matrices this order. These steps may well taken the 
‘computer who has mastered the above argument connection with numerical 
example. 
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SOLUTIONS THE BEHRENS-FISHER PROBLEM, 
BASED THE 


Henry ScHEFFE 


Princeton University 


The Problem. The problem [1, the interval estimation’ the dif- 
ference the means two normal populations when the ratio the variances 
the populations unknown. The reader who wishes see the present 
solution before considering theoretical details will find recapitulated the 
Summary the end and will want refer the following notation: 
and Yn) are random samples from normal popu- 
lations with means and and variances and respectively. 
the order observation, else have been randomized. 

Recently Neyman [3] has called attention solution which shall desig- 
nate (B), and which special case unpublished solution 
will simpler describe (B) later, but mention now that has the 
following advantages: its validity does not depend the values unknown 
parameters, the required computations are simple, and only existing 
tables are needed, the widely available Fisher unsatisfactory 
aspect (B) that when the sample sizes are unequal, the variates 
are completely discarded. The solution below shares with (B) the advan- 
tages indeed, identical with (B) when but when 
free from the above objection. 

Simple Solution. begin with simple restricted approach; later 
will review the result from somewhat broader random variables 
are independently normally distributed with mean and vari- 
ance and and are defined from 


then and are independently distributed; the former 
normal variable with zero mean and unit variance; the latter, 
where generic notation for random variable distributed according 
the with degrees freedom. The quotient 


treat the problem from the standpoint confidence intervals, rather than signifi- 
cance tests, since when the former are available for whole class the latter, namely 
for any hypothesis for all Furthermore, questions the existence 
tests and confidence intervals are closely related [5a]. 

How far Bartlett followed the path this paper not clear from the brief mention 
his results Welch [4], except that did establish the sufficiency certain ortho- 
gonality conditions. 
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where denotes generically variable having the with degrees 
freedom. Define from 


Then set confidence intervals for with confidence coefficient 
Denote the expected length the confidence interval (2), 
(3) 
where 


The symmetrical choice (1) the limits the ¢-distribution minimizes 
consider using connection with the confidence intervals (2) linear 
functions 


The variables have multivariate normal distribution. Necessary and 

sufficient conditions that the all have the same mean equal variances 
and zero covariances, are easily found 


n n 


satisfying (5), and for which minimum. The minimum value m/n: 
this easily proved the use vector algebra. 

note the transpose matrix prime. Then the conditions (5) read 


orthogonal vectors that the complete set satisfies the 
second group conditions this set basis n-space, 


k=1 
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where the are scalars. Now 


and hence m/n. the other hand this lower bound for may at- 
tained taking any set orthogonal vectors with norms 
m/n, that is, and rotating them that their equal angles vector 
For 


that equations (6) are satisfied with m/n. 
especially neat solution this problem was obtained the 
above method; its validity may easily verified 


j>m. 


where 


= 34 
may now write (2) 


The solution (B) mentioned the beginning, consists taking 
(4), that the conditions (5) are satisfied with Hence for both (B) 
and (9) the expected length the confidence interval given (3), but with 


Obvious modifications (9) will make suitable for estimation. 


(3). 
Then 
and 
Oo, 
| : : 
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More General Solutions. now generalize our approach the following 
extent: Let linear form and quadratic form the variates 
Yn, With coefficients independent the parameters 
(i. p.). for some constant p., and some function the parameters, 
h(L and are independently distributed, the former according the 
normal law with zero mean and unit variance, the latter according the 
with degrees freedom, then the quotient 


will have the with degrees freedom, matter what the 
values the parameters. 
note that necessarily then 


(11) E(L) 

The (10) leads the confidence intervals 


where defined (1), and the confidence coefficient Proceeding 
toward (3), find that the expected length (13) 


(14) 


Since are p., follows from (11) and (15) that 


from (12); thus (14) may written 

From (18) also have 

(20) 
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i=l i=l 
are 

Lemma. propose prove next that the maximum value 
that say, impossible obtain for quotient (10) with 
more than degrees freedom. For this need lemma the effect 
that certain well known sufficient conditions for quadratic form have 
are also necessary. 

9 


shorten the notation, write 
Let 


where the indices and range from throughout. Then isi. p., 
and 


where 
From (17) 


The moment-generating function 


There exists non-singular linear transformation from the ¢’s v’s such that 
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Then 


Hence 
(24) 


and the must equal while the remaining vanish. generality 
lost assuming 


k 
1= 


where p., and the are linear combinations the such that 
(27) 


That the conditions (26) are necessary* for constitutes the desired 
lemma. 


Maximum Number Degrees Freedom. have seen that the 
(26) must the form 


substitute (28) and (20) into (27) and write the result matrix form, 


. . k k 
ever new matrix introduced, superscript indicates that has rows 


have incidentally proved sufficiency. 


5 
‘ 
m n 
Se 
| 
| 


IWS 


BEHRENS-FISHER PROBLEM 


columns. Now knew that AA’ and BB’ were p., then could 
equate coefficients and (29) and immediately draw the desired conclu- 
But that BB’ are not obvious, since this need not 
true and However, know that the matrices 


Multiplying (29) the left and the right obtain 


(30) must hold identically Since the coefficients are now p., 
may equate them, hence GG’ Similarly multiplying (29) and 
get Now’ for any matrix rank rank M’M rank 
such that 


where the matrix the first whose diagonal elements are unity and 
all other elements zero. Let Then 


from (31). Likewise can write 


where 


simple proof [5b] this useful theorem the following: Let rank rank 
the rank the product cannot exceed the rank contains 
independent column vectors; the Grammian matrix these vectors non-singular and 
are Grammian matrices (which always have non-negative determinants), hence 
always positive——we use this below. 


Since 
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Then from (32), (33), 
(34) 
Substituting (34) (29), get 


and multiplying the left, the right, 


Similarly find 

(37) 

From (36), VV’ I,. now becomes 


Multiplication (88) the right gives 


Hence U,, therefore and putting this back into (38) 


Minimum Expected Length Confidence Intervals. now point out 
that all confidence intervals (13) with the confidence intervals (9) have 
the minimum expected length. Recalling that the a;, (19) are subject 
the conditions (16), easily find 


t=1 


From (39) and (19) have 


and referring the statement the end section the property (9) asserted 
above now obvious. 


Asymptotic Shortness Confidence Intervals. conclusion wish 
compare our results with the case where the ratio the variances, 
known. 
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then S,/v are mutually independently distributed, the first 
normally with zero mean and unit variance, and Sy/v 
Hence 


TABLE 


TABLE 
Values fore .99 


This relation yields the confidence intervals 


where the confidence coefficient again The confidence intervals (40) are 
known highly efficient; for instance they are the shortest unbiased 
type [5a]. their expected length 


The ratio for (9) for (40) thus 


m 
1.15 1.20 1.23 1.25 1.28 
1.05 1.07 1.09 1.11 
1.05 1.06 1.09 
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siderably >1, but this partly offset approaching its limiting value 
from below that the ratio the c’s <1. The behaviour for finite 
indicated Tables and II. Table (II) tells for example that with 
10, and .95 (.99), the expected length the confidence intervals (9) 
most per cent (20%) longer than that the optimum confidence inter- 
vals (40) available when the ratio known. While may conclude from 
that our solution (9) asymptotically extremely efficient, 
cannot conclude from Tables that for small (9) inefficient, since 
not know what the lengthening effect the extra nuisance parameter the 
Behrens-Fisher problem would confidence intervals. 


Summary. the terminology the first paragraphs sections and 
have proved that there not exist linear form and quadratic form 
the observations such that the quotient (10) will have the (for 
all values the parameters) with more than degrees freedom. 
have further shown that all confidence intervals (13) based the 
tion with degrees freedom, and with confidence coefficient (9) has 
the minimum expected length. The quantities needed apply our solution 
(9) are given (1), (7) and comparing this solution with 
known highly efficient solution for the case when the ratio the population 
variances known, has been possible show that least asymptotically 
our confidence intervals (9) are very short. 
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EXTENSION WILKS’ METHOD FOR SETTING TOLERANCE 
LIMITS 


ABRAHAM WALD 


Columbia University 


Introduction. Let random variable and let f(x) its probability 
density function. Suppose that nothing known about f(x) except that 
continuous. Let independent observations The prob- 
lem setting tolerance limits can formulated follows: For some given 
and called tolerance limits, such that the probability that 


very satisfactory way when nothing known about f(x) except that con- 
tinuous. Wilks proposes the following solution: Let the observed 
values arranged order increasing magnitude. Then and 
where denotes positive integer. The exact sampling distribution 


Zn-—r+1 
the statistic derived Wilks and this provides the solution 


r 


for the problem setting tolerance limits. very important feature Wilks’ 


1 
solution the fact that the distribution entirely independent 


the unknown density function f(x), ie. the distribution 


the same for any arbitrary continuous density function 

this paper shall give extension Wilks’ method the multivariate 
case. bea set random variables with the joint probability 
density function Suppose that nothing known about 
independent observations drawn and the a-th observation denoted 
tie The problem setting tolerance limits 
for can formulated follows: For some given positive values 
ability that 


Mp M, 


Wilks, ‘‘Determination sample sizes for setting tolerance limits,’’ Annals 
Math. Stat., Vol. (1941). 
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holds, equal functions and are called the lower and upper 
tolerance limits x;. natural extension Wilks’ procedure would seem 
and denote some integers. However, this choice the tolerance limits 
does not provide satisfactory solution our problem, since the distribution 
(2) not independent the unknown density function 
will shown this paper that slight modification the above procedure 
the distribution (2) becomes entirely independent the unknown density 
function section will treat the bivariate case and 
section will extend the results multivariate distributions. 


The bivariate case. this section deal with the case when 
equality sign equal zero. define 


where and denote some positive integers and Consider only 
those sample points for which ie. consider 
magnitude. define 


will show that the distribution the statistic 


Moe 


the marginal distribution i.e. 


lated under the condition that M,. Hence 


+o My, 


(8) 


7 


3 
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and 
Me 

(9) 
2 


From (5), (8) and (9) follows that 
(10) PP. 


obvious that the distribution given Wilks’ formula. Since Wilks 
derived the distribution only when will briefly give here the 
derivation for any integers and 
Tir 


where restricted the interval [0, P]. Hence the distribution 
given 


1—P 


1 


the integral the density function over the range must 
equal find that 


Hence the probability density function given 


+ 1) n—8s,+r 


for any given values the conditional distribution given the 
for Hence the conditional distribution given 


ito 
lere 
nits 
tion 
lure 
ider 
sing 
4 


ABRAHAM WALD 


(Py? (1 dP. 


Since the expression (14) does not involve the quantities and distri- 
buted independently and M,. the joint density function 
and given the product (13) and (14), i.e. 


where denotes the product the constant coefficients (13) and (14). From 
(15) follows that the joint distribution and given 


Since the range the interval [Q, 1], the distribution given 


1 


(18) 


(1 rr (1 Q) | (1 aT. 


From (17) and (18) follows that the probability density function 
given 


or dQ. 


The multivariate case. assume that two elements the matrix 


event equal For each let the point with the 


the set all points for which 


Arrange the coordinates the points order increasing magnitude. 
Then equal the element and equal the s;-th element this 
ordered sequence. will derive the distribution 


Pp 1 
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Let 
+00 +00 Mj 


probability density function caleulated under the condition that 


M; 


obviously have 


will prove that the probability density function given 


(24) 


This certainly true 1,2. will assume that true for and 
dently distributed and that the probability density function given 


D D 8j—7j—l—sj4itrj 


The joint distribution and the same form the joint distribu- 
distribution given (19). Making the above substitution formula 
(19) obtain formula (24) Hence the validity (24) proved 


interesting note that the distribution does not depend the 
integers The construction the tolerance limits 
depends the order the variates practical applications the 
asymmetry the construction will very slight, since most practical cases 
If, for example, 1)/n .95, the tolerance limits will affected only 
desirable find construction which entirely independent the order 
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Tolerance regions composed several rectangles. For the sake sim- 
plicity will consider here the bivariate case. All results obtained 
section can extended without any difficulty the multivariate case. 

section the tolerance region has been single rectangle the plane 
variates and are strongly correlated, tolerance region rectangle shape 
seems unfavorable, since will cover unnecessarily large area the 
plane. The situation illustrated figure where the scatter 


The rectangle determined the vertical lines through and and 
the horizontal lines through the sample points with smallest and largest ordinate, 
restricting ourselves points which have abscissa values the interior the 
interval Similarly determined the vertical lines through 
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and and the horizontal lines through the sample points with largest 
and smallest ordinate, restricting ourselves points with abscissa values the 
interior Finally determined the vertical lines through 
and and the horizontal lines through the sample points with largest 
and smallest ordinate, restricting ourselves points whose abscissa values lie 
and much smaller area than the region will see later, the prob- 


T 


will consider tolerance regions the following general shape: Let 
where the size the bivariate sample. Let the vertical line the 
sample points which lie between the vertical lines and obviously equal 
Through each point which lies between the vertical lines 
horizontal lines Where the line above the line 
rectangle determined the lines Visi, Let region 
composed different rectangles R;;. The regions and the example 
illustrated figure are special cases the type regions described 


re 


Let the conditional distribution under the restriction that 


(28) 
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Let 


71,m; 


and 


where a;; the ordinate the lower corners and the ordinate the upper 
corners the rectangle and the summation taken over all values 


Let any random variable which has continuous probability density fune- 
Let the conditional density function under the condition 


(32) 


Um; gti 
where the summation taken over all pairs for which R;; contained 
Let 


Um; grits 


m;ti 


where the summation taken over all values for which contained 
T*. obviously have 


the joint distribution (ii) the distribution the same 
(31) and (33) that the distribution the same that P’. Now will 
derive the distribution P’. The expression can written the following 
form: 


Us; 
(34) 


4 
q 


same 
from 
will 
wing 


l—1 Us; Ys 
i=l Ur, Ys, 


(35) 


Us; 

For any fixed value y,,_, denote the conditional probability density 
under the restriction that and the conditional distribution 
under the restriction that Let 


Ur; 


Yr) 


Then follows from (34) and (35) that 


(36) 


For the distributions and may consider the variates 
which has the distribution dy. Hence, the distribution can de- 
rived from (13) and easy verify that the distribution the same 
independent and because (36) the distribution must 


the same way find that the distribution the same the distribu- 
tion 


Thus, induction see that the distribution the same the distribu- 

follows that the distribution given 


T(sr(n —8 + 1) Po (1 Po) dP». 


Hence, have proved that the distribution given (27). 
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Summary the results and numerical illustrations. shall give 
summary the results obtained and few illustrative examples. The multi- 
variate case being straightforward extension the bivariate case, 
discuss merely the latter. Consider pair random variables and 
y)dx the joint probability density function and and suppose 
that nothing known about f(x, except that continuous. sample 
variate population. The sample can represented points 
the plane y), being the point with the coordinates y;. 
have dealt with the problem finding rectangle the plane y), 
called tolerance region, such that can state with high probability, with 
probability .98 .99, that the proportion the bivariate universe included 
the rectangle not less than given number say not less than .98 
The rectangle constructed follows: Suppose that the points 
are arranged order increasing magnitude their abscissa values, 
vertical line V,, through p,, where and are positive integers such that 
consider the set consisting the points 
Which lie between the vertical lines V,, and drawa 
horizontal line H,, through the point which has the smallest ordinate 
Finally horizontal line drawn through the point which has 
the smallest ordinate The values and are positive integers for 
which The tolerance region the rectangle determined the lines 
Vs,, and H,,. The probability that least the porportion 


To n—s r P. 
and and are positive integers, then has the F-distribution 
variance distribution) with and degrees freedom. Thus, 


dom. From (37) follows that equal the probability that 


where has the analysis variance distribution with 2(n and 
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the following table gives the value the sample size which necessary 
for having the probability that least the proportion the universe 
included the tolerance rectangle 


Thus, want the probability .99 that the tolerance region will include 
least per cent the universe, the sample size must 499. 

section tolerance regions are considered which are composed several 
rectangles. Such tolerance region may more favorable than single rec- 
tangle and are highly correlated. illustration consider tolerance 
regions constructed follows: Suppose that divisible and the sample 
points are arranged order increasing magnitude their abscissa 
values. draw the vertical lines Vi, V2, and through the points 
the vertical lines and and the horizontal lines and where 
and are defined follows: consider only the points which lie between the 
two vertical lines and (points the vertical lines are excluded). From 
these select the point with the smallest and the point with the largest ordinate. 
The lines and are the horizontal lines which through these two points 
respectively. The tolerance region composed the four rectangles 
equal tos Thus, according the results section the prob- 


ability distribution the proportion the universe included the region 
given 


Numerical calculations show that 1000 the probability .99 that least 
98.1 per cent the universe will included the tolerance region 7*. 
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ASYMPTOTIC FORMULAS FOR SIGNIFICANCE LEVELS CERTAIN 
DISTRIBUTIONS 


PEISER 


Cornell University 


Introduction. The purpose this paper derive asymptotic formulas 
for the significance levels, per cent points, certain well-known statistical 
Although restrict ourselves here two distributions, those 
Chi-Square and Student’s will apparent that the methods used are 
applicable many other distributions well. 

The following results are obtained. Let the per cent point the 
normal distribution, that is, the distribution defined 


that 


and denote the per cent points the Chi-Square and 
distributions with degrees freedom respectively, then 


and 


These formulas approximate the true values and high degree 
accuracy. comparative values for several values and are given 
Section 

shall need the following theorem due Cramér [3, 81; see also pp. 
86-87]. 


THEOREM Let sequence independent, identically distrib- 
uted random variables having absolutely continuous distribution function with 
mean value zero, dispersion and finite fifth absolute moment. Let the 


r-th semi-invariant Then 


This problem was proposed the author Curtiss. 
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The Chi-Square distribution. random variable said distrib- 


uted according Chi-Square with degrees freedom its distribu- 
tion function 


The variable then has the distribution function 


write 


x20, 


that 


(2.4) 


and (1.2) that 


Fisher [1, 81] has suggested the use 


closer approximation, 


has been obtained Wilson and Hilferty interesting note that, 


according (1.3), this last approximation correct terms the zero-th 
order 


Then and, the additive property the Chi-Square distribution 
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(2.3) that 


(2.6) 


since Then (2.5) and (2.6) 


According (2.3) may now write 
where 
and simple change variables (2.1) yields 
n—1 


n 


let 
(2.9) 
then 
2.10 
where (1.2) and (2.4), 
(2.11) => + - Fon(x’>. 


Using Stirling’s formula for (2.8), (2.11) becomes 


ow 
P 
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where has the same given above for call these last 


P 
where polynomial. Since this last sum converges, have 


2.14 


since 


f 
and the uniform convergence (2.14), 
Integrating parts obtain 
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Further integration parts and the use (2.7) yields 
(2.17) 
Then, (2.10), (2.12), (2.13), (2.15), (2.16) and (2.17), 


that 


Pp 


Equation (1.3) now follows once. 
Student’s the random variable has the distribution function 


for degrees freedom and has the distribution function 


2), the variable then has the distribution function 


(3.1) 
write 

that 


and let then and follows from (1.1) and (1.2) 
that 


Hence 
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2), and the additive property the normal distribution, 


relation (3.1). Thus and 6n/(n 4). follows from (1.5) and 
(3.2) that 


since 0(1). (3.4) and (3.5) have 


3 


lim Nan » 
4 


and equation (1.4) follows once. 


Hence 


Tables. The following tables compare the true values and 
with those obtained from (1.3) and The true values [4], [5], (to three 
decimal places) are shown 


TABLE 
50.908 43.777 40.257 20.600 
135.807 124.342 118.498 99.334 
| | | 
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TABLE 
2.634 1.813 1.921 0.700 
2.298 2.000 1.670 1.161 0.679 
2.299 2.000 0.679 
2.270 1.980 1.658 1.156 0.677 
2.270 1.980 658 1.156 0.677 
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GENERALIZATION POINCARE’S FORMULA THE THEORY 
PROBABILITY 


Kar Lat 
Tsing Hua University, Kunming, China 


combination (without repetition) out (1, --- denote the probability 


where the summation extends all the combinations with members out 
(1, 
Then formula may written follows: 


equivalent formula is: 


The following conventions concerning the binomial coefficients are made: 


Two generalizations, possibly due Mises, are 


ee = — (t—m) 1 
Pn(1, n) ( 1) 1) S; 


notice that the probabilities appearing the left-hand sides these 
formulas are with respect the set suffixes (1, and the 
sums the right-hand sides are the same way. 

natural generalization let consider probability which symmetrical 
with respect certain sub-sets divide the events into 
sets: 


first set events exactly events occur; and out the second set events 

exactly events occur; and on; and out the rth set events 

exactly events occur. When this problem solved the analogous problem 


Von, ? Yrn, 
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which replace some the words ‘at least” can also 
solved. 


denote the required probability the left-hand side the following 
generalized Poincaré’s formula: 


ij=m, ig=m2 
12 


where 


the summation extending all those combinations a’s such that for every 

Proor: Let denote the probability the occurrence the events 
easily seen that 


n— 


where for fixed the second summation extends all the combinations 

Now let each each the right-hand side (1) decomposed into 
sum the the last-written way. Consider fixed 


1 2 r 


Hence, its total contribution the right-hand side (1) 
ni n2 Nr 
ig=m2 
otherwise 


> 
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Therefore after the decompositions and the collecting terms, the only p’s 
remaining the right-hand side (1) are those which for every 
Thus the right-hand side reduced 


where the summation extends all those combinations such that for 

just considered, replace our notation the square-bracketed un- 

before, noting that have 


and identity which can proved induction 

parallel generalization Poincaré’s formula follows: ask for the 
probability that ezther out the first set exactly events occur; out the 
second exactly finally, out the rth set exactly That is, in- 
stead repeated conjunctions may consider repeated disjunctions. 
denote the required probability the left hand side (2), then given 
terms the p’s defined above (1) the right-hand side below: 


Other events symmetrical with respect each the sub-sets, whose def- 
inition the words appear arbitrarily, may 
considered. 

Lastly, only mention that first application the formula (1) can used 
establish the formula 


first obtained its significance may refer [1], and con- 
tinuation that paper published shortly. 

REFERENCE 


trary Annals Math. Stat., Vol. (September 1941). 
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TABLES FOR TESTING RANDOMNESS GROUPING 


University Wisconsin 


When two different kinds objects are arranged along line they will form 
two more distinct groups like objects. Thus, the arrangement: aabbbab, 
there are a’s and b’s forming groups. general, there are objects 
one kind and objects another kind, there are all 


m+n m+n 
Cs 


different arrangements possible. There will loss generality assume 
that 


defined the number distinct groups like objects any one 
arrangement, then the proportion arrangements yielding less groups 


where 
and 


126’ 


The following tables have been prepared for use testing data for random- 
ness and for testing whether two samples are from the same population. Table 
inclusive whereas gives correct values for for .005, .01, 
.05, .95, .975, .99 and .995, where the largest integer, u’, for which 
ewhen .50, and the smallest integer, for which P{u w’} 
when .50. This table was obtained from Table and covers the same 


Eugenics, Vol. Part (1939) pp. 10-17). 
Wald and Wolfowitz, “On Test Whether Two Samples are from the Same Popu- 
(Annals Math. Stat., Vol. XI, No. June (1940) pp. 147-162). 
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TESTING RANDOMNESS GROUPING 


100. These values were obtained using the normal approximation given 
page 151 the Wald-Wolfowitz paper together with correction for con- 
tinuity not given their article—this correction improved the approximation 
Table III although they can obtained from Table order 
check the adequacy the approximation. These values obtained with the 
approximation check with those Table except for the five underscored 
values. appears that the approximation will adequate general for 
20. 

consider case where one might suspect nonrandomness and, more specifically, 
expect too few groups. The arrangement diseased and healthy plants 
row field might such case. For example, might have the following 
plant arrangement: 


where 


the number diseased plants present, 
20, the number healthy plants present, 
the number groups actually formed. 


From Table the probability associated with this arrangement found 
.018,3512, which the probability u’. Since .05, might elect 
regard this evidence tendency for the disease nonrandomly 
distributed among the plants row, knowing that look for explana- 
tion whenever P{u u’} .05 may expect follow false 
scent not more than one time twenty the long run. 

When control suggests the presence assignable causes variation 
manufactured product flowing from production line, examination 
various types runs, e.g. the lengths and relative frequency runs above and 
below the median sequence values, may assist diagnosing the nature 
the cause. Dr. Walter Shewhart has such instance: se- 
quence observations dealing with corrosion suggested the presence 
assignable cause variation. the use run charts assignable cause 
variation was tracked down the measuring apparatus and attempt was 
made eliminate it. The original sequence examined with regard runs 
above and below the median the sequence exhibited unexpectedly large 
number runs length more and result significantly low value 


American Defense Emergency Standards Z1.1 and Z1.2 entitled “Guide for Quality 
and “Control Chart Method Analyzing and American War Standard 
entitled Chart Method Controlling Quality During (pub- 
lished the American Standards Association, New York City). 
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FRIEDA SWED AND EISENHART 


and, the assignable cause were not completely eliminated the new design, 
might expect too large proportion long runs above and below the median, 
and, hence, too few total runs. sequence observations taken with the 
new measuring device yielded total runs above and below the median 
the sequence which significantly fewer than would expected arise under 
state statistical control, since 20, P{u 15} .038. This 
sequence special interest since the occurrence too few runs suggested 
the assignable cause had not been entirely eliminated although especially 
long runs, say length more, occurred this sequence, that from the 
point view length runs without regard their number the assignable 
cause might have been judged have been eliminated. 

instance where too many groups would the probable alternative 
randomness consider the arrangement occupied and unoccupied seats 
lunch counter about half hour before the popular lunch hour begins. 
such case the critical region would and the appropriate probability 


would P{u 1}. Sucha situation was observed and yielded 


the following arrangement empty and occupied seats along the lunch counter: 


11, 


and though this probability not quite significant, the arrangement observed 
has the maximum number groups empty and occupied seats for the and 
the size observed since two occupied seats are adjacent. However, 
another customer had entered and sat either the 5th empty seat from the 
left the 8th empty seat, the number groups would have been increased 
two and the situation would be: 


10, 
13, 


This value significant, and for this assumed case, well for the actual 
case observed, the arrangement E’s and O’s has the maximum number 
groups like objects. Certainly both these cases exhibit too many groups 
considered random arrangements. 

The use these tables test whether two samples constitute independent 
random samples from the same can illustrated using the data 
Snedecor’s Example 4.11 page his Statistical Methods (3d edition) 


and Wolfowitz (ibid) have pointed out that exceptionally small values 
are regarded evidence for rejecting this null hypothesis. 
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TESTING RANDOMNESS GROUPING 


which gives daily gains two lots steer calves two different rations. The 
daily rates gain given for the two lots are: 


2.17, 2.06, 2.11, 2.24, 2.52, 2.04, 1.95; 
1.82, 1.85, 1.87, 1.74, 2.04, 1.78, 1.76, 1.86. 


Arranging these rates order magnitude, designating calf ration 


(1.74), (1.76), (1.78), (1.82), (1.85), (1.86), (1.87), 1.95, 1.95, 
(2.04), 2.04, 2.06, 2.11, 2.17, 2.24, 2.52. 


Whence 
.008,8578. 


Accordingly, either the .05 .01 level significance rejection the null 
hypothesis that the two samples constitute independent random samples from 
the same population indicated. 

For these data note the fact that having two identical values, i.e. 2.04, 
the two lots did not alter the number groups regardless whether they were 
recorded (2.04), 2.04 2.04, (2.04). However, such duplications general 
may more bothersome, since they may yield different values depending 
the order which they are considered. such instances both possible 
orders should considered. 

The merit this test that employs minimum assumptions—merely 
that the common population continuous, and that the samples drawn 
random independently. Its principal defect its lack power. conse- 
quence gross disparity between the samples generally required render 
Therefore, when additional assumptions are tenable, tests utilizing 
them should employed. 


Most the computing and checking these tables was done Frieda 
Swed, Philip Ritz and Beatrice Kelley with some assistance from Jay Grod- 
man, Edward Halamka and Mrs. Henry Wallman. Also, Duane Borst and 
Francis Cox helped with the typing and the proofing the tables. 
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TABLE (Concluded) 


n=1g 
20 +[2130,0916 .212,9756 o314,2734 .433,0908 


2 


2874, 4085 


8070 


0965 


-999,9999 
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e=19 


TESTING RANDOMNESS GROUPING 


DW 


DODD 


«OM, 


NUNN 
NUNN 


NUNN 


2 2 6 S 4 S S 


N JO 


7 
001 NOCH HHH 
i 3 
fe) 
a 
Dh | 
o 
7 


FRIEDA SWED AND EISENHART 


TABLE (Concluded) 
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,0280 


3070 


2948, 4301 
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TABLE 
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NOTES 


This section devoted brief research and expository articles, and notes 
methodology. 


NOTE THE BEST LINEAR ESTIMATE 


University Iowa 


Introduction. Let the chance variable subject the distribution 
function D(x) and usual let denote the mathematical expectation 
linear estimate E(x) when the c’s are chosen that E(x), and 
estimate best the sense defined, yet such that, being another estimate, 


The rectangular distribution. Consider D(x) 1/a,0 and let 
the items each sample arranged ascending order magnitude that 


Thus 


and 


From E(y) E(x) a/2, have 
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BEST LINEAR ESTIMATE 


homogeneous linear equations unknowns: 


Since the determinant the coefficients not zero, the solution 
where the largest item the sample. 
The distribution function readily found 


n+1 


4n(n 

has long been that the sampling distribution the 
where and are respectively the smallest and largest items 
samples size from rectangular population, has smaller variance than 
does that the arithmetic mean all items. The distribution function 


From this, follows that 


n—1 
n—1 


Yet Pittman has recently proved that for every 
exceeds the probability that any other estimate, including will fall 
this interval length about the mean a/2. 


Cy 


then the limits and D(v) 


u-l 


Soc. London, Series Vol. 222 (1921), pp. 309-368. 
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EDWARD PAULSON 


Thus neither nor has asymptotic normal distribution. 
is, course, this fact which makes the criterion minimum variance illusory. 


Other polynomial distribution functions. Let repeated samples in- 
dependent values drawn from population characterized D(x) 
k+1 


where before the largest item the sample. The sampling distribution 
easily obtained. follows that 


where the arithmetic mean the sample. Again, write 


NOTE TOLERANCE LIMITS 


Epwarp 


Columbia University 


Among various statistical problems arising the process controlling quality 
mass production, rather important one appears the determination 
tolerance limits when the variability the product known due ran- 
dom factors. This problem was recently treated pioneer article Wilks. 
This note will point out relationship between tolerance limits and confidence 
limits (used the sense Neyman), and will use this concept establish 
tolerance limits when the product described two qualities, the measure- 
ments which are assumed have bivariate normal distribution. 

For the case single variate, the problem finding tolerance limits 


future observation falling the random variates and then 

The relationship between confidence limits and tolerance limits will arise 


confidence limits are determined, not for parameter the distribution, but for 


Work done under grant-in-aid from the Carnegie Corporation New York. 
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TOLERANCE LIMITS 


future random observation (or for some function the observations future 
independent sample). This based the following simple lemma: confidence 


U1 


respectively, then the definition expected value 


This triple integral however exactly the probability that will lie between 
and which the nature confidence limits must equal which proves 
the lemma. manner follows that the basis given sample 
from future sample, and denotes the probability that all fall 
the confidence region, then E(P) repeated sampling equals establish 
tolerance limits, necessary addition E(P) also know the distribution 

appears, least intuitive basis, that the confidence interval 
can used determine the shape the ‘‘most efficient” tolerance limits; this 
intuitive notion will gain additional support from the character the tolerance 
region which will now derived for observation (x, from distribution 
with probability density y), where 


Suppose have independent samples 


both from f(x, y). Then known that 


i 1 


i=1 


Hotelling’s Generalized Student Ratio confidence region for future 

denotes the probability future observation falling this ellipse, then 


dxdy. utilizing the fact [2] that invariant under linear 
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EDWARD PAULSON 


transformations, not difficult see that the distribution will not in- 
volve any unknown parameters, its distribution can under the 


know that E(P) and will now calculate the variance ex- 
panding Taylor Series (to terms the first order) about the point 


Taking derivatives and evaluating about the population values 
n+1 
n+l 
Since for ordinary values .95 .99) the distribution seems 


approach normality very slowly, will follow suggestion Wilks and sup- 
pose that fairly close approximation the distribution will given 


T(u 


(1) 


/ 
~ 


= 


THE CHI-SQUARE DISTRIBUTION 
2 2 2 


This distribution can now used establish tolerance limits. For example, 
follows from (1) that for sample size 214, and tolerance region given 
the ellipse 9.21, then .99 and the Prob.{.985 .995} 

Care must taken the use these and similar results, for the distribu- 
tion not bivariate normal one, large error may introduced which will 
not eliminated with increasing however the error probably small 
when tolerance region found for the means future sample obser- 
vations 20) contrasted with tolerance region for single observation. 
exact treatment the case when the bivariate distribution unknown has 
been given Wald the present issue the Annals Mathematical Statistics. 


REFERENCES 
‘‘Determination sample sizes for setting tolerance Annals 
Math. Stat., Vol. (1941), pp. 91-96. 


Vol. (1931), pp. 360-378. 


NEW APPROXIMATION THE LEVELS SIGNIFICANCE 
THE CHI-SQUARE DISTRIBUTION. 


AROIAN 


Hunter College 


Recent articles the percentage points the distribution [1], [2], have 
distribution [3], method particularly useful and easily computed for large. 


the measure skewness for the distribution, the following formulas give sig- 
and were found the usual method least squares, fitting each formula 
the values [4] for +0.1, +0.2, +0.3, and +0.4. Then the value 
each instance was adjusted give the proper value when e.g. 
the constant term the method least squares for the per cent point 
50, but the formulas are quite satisfactory for 30. Formulas for when 


[3] are easily derived, but such results while more accurate the range 
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Second value correct result. 


THE CHI-SQUARE DISTRIBUTION 


The maximum error for the range a3| .4, the fourth significant 
respectively for the .01%, .1%, 1%, 2.5%, 5%, 10%, 20%, 25%, 30%, 
40%, and 50% points respectively. The error increases outside the indicated 
range. addition 


and similarly for other percentage points. These are obtained from (1) re- 

compare results obtained these methods against those Wilson and 
Hilferty all cases except the 95% level the method here proposed 
superior. Table compares the two methods. was copied from [2] except 
for the corrections the Wilson and Hilferty method for the 95% level and 
the accurate value for the level 75, 96.2160 place 96. 11. 
Table gives comparisons for other levels when 30. 


REFERENCES 

32, Part 

Biometrika, Vol. 32, Part 

Annals Math. Stat., Vol. (1941). 

[4] Satvosa, Pearson’s Type III Annals Math. Stat., 
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NEWS AND NOTICES 


Readers are invited submit the Secretary the Institute news items 
general interest 


Personal Items 


Associate Professor Evans the Mathematics Department the Uni- 
versity Wisconsin has been promoted professorship. 

Assistant Professor Willy Feller the Mathematics Department Brown 
University has been promoted associate professorship. 

Dr. Carl Kossack the Mathematics Department the University 
Orégon has been promoted assistant professorship. 

Dr. Eugene Lukacs has been appointed assistant professorship the 
Mathematics Department Illinois College. 

Professor Mode has been made chairman the Mathematics Depart- 
ment Boston University. 

Mr. Charles Mummery has been made Product Quality Engineer the 
Scioto Ordinance Plant the Rubber Company. 

Professor Rietz has retired after twenty-five years service Head 
the Mathematics Department the University Iowa. 


The Foundation for the Study Cycles has announced that medal will 
awarded the individual making the most significant contribution cycle re- 
search during 1943. Communications should addressed to: Professor Ells- 
worth Huntington, Yale University, New Haven, Connecticut. 


Obituary 


Professor Edward Dodd the Mathematics Department the University 
Texas died January 1943 the age sixty-seven years. was 
charter member the Institute. was elected one the Vice-Presidents 
the Institute for 1943. His contributions mathematical statistics consist 
numerous research papers general mean statistical 
variables; and statistical theory periodicities. 


Stanford Courses Statistical Methods Quality Control 


novel procedure adult education, and particularly statistical education, 
took place last summer Stanford University, when courses the Shewhart 
statistical methods quality control were offered short intensive courses. 
There were two courses, one the campus Stanford University, July 
and the other Los Angeles, September 20-27. The first course covered ten 
full days, and the second eight. Both courses ran eight hours per day, Satur- 
days and Sundays included. 
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The features the course may described the following points: 

The courses were short, thus making possible for men industry 
attend. 

hours’ instruction was sufficient cover the field adequately. 

The instruction covered wide range points view. 

The students were picked delegates sent industry. 

The courses are being followed with monthly meetings Los Angeles and 
San Francisco. 

Intensive courses this character were first suggested Dr. Edwards 
Deming April 1942, while was temporarily detailed the office the 
Chief Ordnance the War Department, and the first course actually com- 
menced just three months later. giving the course men already in- 
dustry, the yield obtained was manyfold higher than can expected from 
regular college course. Contributions and reports made the delegates sub- 
sequent the courses supply abundant foundation for this statement. 

West Coast industry and the Army and Navy ordnance districts sent dele- 
gates the West Coast, and the second. Through the efforts Professor 
Eugene Grant Stanford, industry and the Army and Navy were persuaded 
send some their mést valued officials. The instruction was organized 
Professor Holbrook Working. Both and Professor Grant took active part 
the instruction, which was supplemented both courses Dr. Edwards 
Deming exponent government and industrial sampling. the first 
course, Mr. Charles Mummery the Hoover Company served in- 
structor from the viewpoint industry. the second course (the one Los 
Angeles), Mr. Ralph Wareham the General Electric Company occupied the 
industrial corner the square instruction. The expense the instructors 
was paid out ESMWT funds (Office Education). Montly follow-up courses 
San Francisco and Los Angeles, under the direction Professors Working and 
Grant, supply the necessary power for maintaining momentum, and for gathering 
the men together for directed study and consultation. 

The demand for men trained this line far exceeds the supply, and there are 
movements afoot provide similar courses number industrial cities. 
Three-day courses dozen more key ordnance cities were held last fall 
the Ordnance Department. The lecturers were Messrs. Edwards and 
Harold Dodge the Bell Telephone Laboratories, and Mr. Rupert Gause 
the Aberdeen Proving Ground, now with the Army Ordnance Washington. 
These courses and the Stanford courses alleviated the situation considerably, 
further instruction needed. 


Junior Membership the Institute 


the annual election for 1942 which was held mail ballot because the 
postponement the Annual Meeting, constitutional amendments were approved 
which created new grade membership the Institute, known Junior 


REPORT PRESIDENT 


Membership. hoped that this provision for Junior Membership will 
stimulate interest mathematical statistics the advanced undergraduate 
level colleges and universities. 

The Board Directors have approved the following rules governing Junior 
Membership: 

Any undergraduate student collegiate institution eligible for election 
Junior member the Institute Mathematical Statistics provided 
that she sponsored member the Institute. 

The annual dues ($2.50) must submited with the application. 

Annual membership shall coincide with the calendar year and the Junior 
Member shall receive complete volume the Annals Mathematical 
Statistics for the year which she elected. 

Junior Membership shall limited term two years, but Junior 
Member may apply for transfer ordinary membership the beginning 
his second year. 

For the convenience any Institute member who may wish sponsor 
Junior Member application blank provided the back this issue the 
Annals. Additional blanks may obtained from the Secretary the Insti- 
tute. 


Announcement May Meeting New York 


There will joint meeting between the Institute and the American Society 
Mechanical Engineers Saturday, May 29, 1943, the Engineering Societies 
Building, West 39th Street, 

The meeting will consist two sessions industrial applications mathe- 
matical statistics. The topics are follows: 

Morning Session, A.M. 
Chairman: Harold Hotelling 
Wolfowitz, the Theory Runs with some Quality Control. 
Churchill Eisenhart, the Presentation Data Evidence. 
Afternoon Session, P.M. 
Chairman: Shewhart 
Dodge, Sampling Inspection Plan for Continuous Production. 
Young, Tolerances and Product Acceptability. 


ANNUAL REPORT THE PRESIDENT THE INSTITUTE 


Ordinarily the business meeting and the luncheon customarily held 
part the annual meeting the Institute, the President has the opportunity 
make public acknowledgement those individuals, aside from the officers, who 
served the Institute during the year, and have his say concerning past progress 
and future plans. This year appears that the pages the Annals must 
used for this purpose. 
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the result proposals made and approved our last regular annual meet- 
ing New York City, larger number special committees than usual were ap- 
pointed for 1942 and thus more members than before were specifically asked 
participate the affairs the Institute. The Institute much indebted 
these individuals for the way which they responded. 

Professors Craig, Harold Hotelling, and Wilks, Chairman, consti- 
tuted committee study the Board Directors the Institute and the 
formal connection between the Institute and its journal, The Annals Mathe- 
matical Statistics. Their recommendations were incorporated amendments 
the constitution and by-laws recently approved the Institute. The Board 
was increased size and given greater continuity including the two 
previous presidents, and the editor the Annals, officio, and increasing 
the term the Secretary-Treasurer three years. 

The new class junior memberships the result study this question 
committee composed Professors Bushey, Boyd Harshbarger, and 
Snedecor, Chairman. Regulations, since approved the Board, under which 
local chapters the Institute may formed, were drawn committee 
consisting Dr. Kossack, Professor Larsen, and Professor 
Camp, Chairman. 

Dr. Aroian, Dr. Daly, Mr. Dodge, and Professor Baten, 
Chairman, committee agreed assist the Institute endeavoring bring 
the Annals Mathematical Statistics the favorable attention municipal, 
industrial, and college libraries which had not been subscribers it. result 
their fine work, good number domestic libraries has been added our sub- 
scription list, thus serving counterbalance our losses abroad. 

The Program Committees for the year consisted Professor Churchill 
Eisenhart and Mr. Molina, Chairman, for the September meeting Pough- 
keepsie, New York, and Professor Dwyer and Dr. Deming, chair- 
man, for the projected Cleveland meeting. The Institute always much in- 
debted those who the work arranging its programs for meetings, but this 
year owe Dr. Deming special acknowledgement, who prepared excellent 
program for Cleveland, then one for New York meeting under extremely short 
notice when the Cleveland meeting was cancelled, and then had that meeting also 
cancelled. Van Voorhis acted our representative the Committee 
Local Arrangements for the meeting planned for Cleveland. 

The membership committee appointed for 1942 was made Dr. 
Deming, Professor Dodd, and Professor Craig, chairman. After 
Professor Craig took his commission the Navy, Professor Camp 
agreed take his place this committee. 

For some years Professor Craig has generously acted custodian our 
files back numbers the Annals. This service the Institute has been 
taken over Professor Knowler, who has been much assistance. Dr. 
Blanche did considerable amount work connection with finding 
advertisers for the Annals. 
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Our annual meetings had been increasingly successful recent years and 
was real sacrifice for the Institute forego the one planned for 
the war demonstrating still more ways and places the importance sound 
statistical methods, for the present imposes serious responsibilities the 
friends the Institute. reading the report our faithful and efficient 
Secretary-Treasurer will amplify this statement. the present Board, Pro- 
fessors Olds, Wilks and Craig met Pittsburgh January and consider 
some our problems. Though there seems prospect national meeting 
the coming year, hoped that some local meetings can held and that 
other ways can keep the activities the Institute. particular there 
exists the opportunity organizing local chapters the Institute the larger 
centers which would particularly valuable now. industrial areas may 
contribute the war effort well promote important aspect mathe- 
matical statistics endeavoring useful the development and application 
industrial statistics. clear that the Institute needs the loyal support 
its membership now much ever before fulfill the for which 
was founded. 


President. 
December 31, 1942. 


ANNUAL REPORT THE SECRETARY-TREASURER THE 
INSTITUTE 


September 8-9 the Institute met Vassar College conjunction with the 
American Mathematical Society and the Mathematical Association America. 
Mr. Molina and Professor Churchill Eisenhart were charge the pro- 
gram. Fifty eight members the Institute attended the meeting. The Annual 
Meeting, originally scheduled for Cleveland then transferred New York City, 
was finally postponed the request the Office Defense Transportation. 
the present time seems that this meeting will have abandoned entirely 
and the Institute must content with holding local meetings some the 
larger cities. 

Because the postponement the Annual Meeting, the annual election was 
held mail. The following officers were elected: Professor Cecil Craig, 
President; Professors Edward Dodd and Abraham Wald, Vice-Presidents; 
and Professor Edwin Olds, Secretary-Treasurer. Nine amendments the 
Constitution and six amendments the By-Laws were proposed and accepted 
two-thirds majority those voting. Professor Fetters acted teller. 

During the past year the Secretary has cooperated with industrial concerns and 
government agencies locating statistically trained personnel fill positions 
created the emergency. Members the Institute are requested keep the 
Secretary informed regarding the availability such personnel. 

The death one member the Institute has been reported since the last 
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annual meeting—Dr. Robert Henderson, former Vice-President and Actuary 
the Equitable Life Assurance Society. 

The following financial statement covers the period from December 10, 1941 
December 10, 1942 (the books and records the Treasurer have been audited 


Mr. George Niver and found agreement with the statement sub- 
mitted): 


FINANCIAL STATEMENT 
December 10, 1941, December 10, 1942 


EXPENDITURES 
ANNALS OFFICE 
Back NuMBERS OFFICE 
Purchase back numbers from $355.77 
Reprinting 300 copies Vol. No. 142.16 
$497 .93 
SECRETARY-TREASURER’S OFFICE 


comparison with the financial condition the Institute the end 1941, 
the receipts from dues, subscriptions, and sales back numbers have increased 
more than $800. This mostly due large increase the sales back num- 
bers and net increase fifty members. The increase expenditures the 
Institute was accounted for the increased cost printing the Annals. This 
marks the beginning trend which seems likely continue throughout the war. 
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would seem over-optimistic expect that the financial situation the 
Institute would continue show marked improvement 1943. Present indi- 
cations suggest that shall very fortunate avoid considerable deficit 
operations. large number our foreign subscribers have not renewed and 
face considerable difficulty delivery the Annals those still force. The 
large increase the sales back numbers was due rather successful effort 
persuade domestic libraries provide themselves with complete sets back 
numbers while the issues were still available. The Institute faces increase 
operating expenses and advance the cost producing the Annals. The 
full cooperation all members needed are avoid decrease the work 
the Institute during 1943. 


Secretary-Treasurer. 
December 31, 1942. 


behalf the Board Directors the Institute, regret announce the 
sudden death Vice-President Dodd, January 1943, shortly after 
this report was written. Dr. Deming was appointed the Board 
Directors fill the vacancy created Vice-President Dodd’s death. 


CONSTITUTION 
THE 
INSTITUTE MATHEMATICAL STATISTICS 


ARTICLE 


NAME AND PURPOSE 


This organization shall known the Institute Mathematical Statistics. 
Its object shall promote the interests mathematical statistics. 


ARTICLE 


MEMBERSHIP 


The membership the Institute shall consist Members, Junior Members, Fellows, 
Honorary Members, and Sustaining Members. 

Voting members the Institute shall (a) the Fellows, and (b) all others, Junior 
Members excepted, who have been members for twenty-three months prior the date 
voting. 

person shall Junior Member the Institute for more than limited term 
determined the Committee Membership and approved the Board Directors. 


ARTICLE III 


The Officers the Institute shall President, two Vice-Presidents, and Secre- 
tary-Treasurer. The terms office the President and Vice-Presidents shall one year 
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and that the Secretary-Treasurer three years. Elections shall majority ballots 
Annual Meetings the Institute. Voting may person mail. 

(a) Exception. The first group Officers shall elected majority vote the in- 
dividuals present the organization meeting, and shall serve until December 31, 1936. 

The Board Directors the Institute shall consist the Officers, the two previous 
Presidents, and the Editor the Official Journal the Institute. 

The Institute shall have Committee Membership composed three Fellows. 
their first meeting subsequent the adoption this Constitution, the Board Di- 
rectors shall elect three members Fellows serve the Committee Membership, 
one member the Committee for term one year, another for term two years, 
and another for term three years. Thereafter the Board Directors shall elect from 
among the Fellows one member annually their first meeting after their election for 
term three years. The president shall designate one the Vice-Presidents Chairman 
this Committee. 


ARTICLE 
MEETINGS 


meeting for the presentation and discussion papers, for the election Officers, 
and for the transaction other business the Institute shall held annually such 
time the Board Directors may designate. Additional meetings may called from 
time time the Board Directors and shall called any time the President 
upon written request from ten Fellows. Notice the time and place meeting shall 
given the membership the Secretary-Treasurer least thirty days prior the date 
set for the meeting. All meetings except executive sessions shall open the 
Only papers accepted Program Committee appointed the President may pre- 
sented the Institute. 

The Board Directors shall hold meeting immediately after their election and 
again immediately before the expiration their term. Other meetings the Board may 
held from time time the call the President any two members the Board. 
Notice each meeting the Board, other than the two regular meetings, together with 
statement the business brought before the meeting, must given the members 
the Board the Secretary-Treasurer least five days prior the date set therefor. 
Should other business passed upon, any member the Board shall have the right 
reopen the question the next meeting. 

The Committee Membership shall hold meeting immediately after the annual 
meeting the Institute. Further meetings the Committee may held from time 
time the call the Chairman any member the Committee provided notice such 
call and the purpose the meeting given the members the Committee the 
Secretary-Treasurer least five days before the date set therefor. Should other business 
passed upon, any member the Committee shall have the right reopen the question 
the next meeting. 

Ata regularly convened meeting the Board Directors, four members shall con- 
stitute quorum. convened meeting the Committee Membership, 
two members shall constitute quorum. 


ARTICLE 


PUBLICATIONS 


The Annals Mathematical Statistics shall the Official Journal for the Institute. 
The Editor the Annals Mathematical Statistics shall Fellow appointed the 
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Board Directors the Institute. The term office the Editor may terminated 
the discretion the Board Directors. 
Other publications may originated the Board Directors arises. 


ARTICLE 
EXPULSION SUSPENSION 


Except for non-payment dues, one shall expelled suspended except 
action the Board Directors with not more than one negative vote. 


ARTICLE VII 


AMENDMENTS 


This constitution may amended affirmative two-thirds vote any regularly 
convened meeting the Institute provided notice such proposed amendment shall have 
been sent each voting member the Secretary-Treasurer least thirty days before the 
date the meeting which the proposal acted upon. Voting may person 
mail. 


BY-LAWS 
ARTICLE 


BERSHIP 


The President, his absence, one the Vice-Presidents, the absence the 
President and both Vice-Presidents, Fellow selected vote the Fellows present, shall 
preside the meetings the Institute and the Board Directors. meetings the 
Institute, the presiding officer shall vote only the case tie, but meetings the 
Board Directors may vote all least three months before the date the 
annual meeting, the President shall appoint Nominating Committee three members. 
shall the duty the Nominating Committee make nominations for Officers 
elected the annual meeting and the Secretary-Treasurer shall notify all voting members 
least thirty days before the annual meeting. Additional nominations may sub- 
mitted writing, signed least ten Fellows the Institute, the time the 
meeting. 

The Secretary-Treasurer shall keep full and accurate record the proceedings 
the meetings the Institute and the Board Directors, send out calls for said meetings 
and, with the approval the President and the Board, carry the correspondence the 
Institute. Subject the direction the Board, shall have charge the archives and 
other tangible and intangible property ofthe Institute, and once year shall publish 
the Annals Mathematical Statistics classified list all Members and Fellows the 
Institute. shall send out calls for annual dues and acknowledge receipt same; pay 
all bills approved the President for expenditures authorized the Board the Insti- 
tute; keep detailed account all receipts and expenditures, prepare financial statement 
the end each year and present abstract the same the annual meeting the 
Institute after has been audited Member Fellow the Institute appointed the 
President Auditor. The Auditor shall report the President. 

Subject the direction the Board, the Editor shall charged with the responsi- 
bility for all editorial matters concerning the editing the Annals Mathematical Sta- 
tistics. with the advice and consent the Board, appoint Editorial Commit- 
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tee not less than twelve members co-operate with him; four for period five years, 
four for period three years, and the remaining members for period two years, ap- 
pointments made annually needed. All appointments the Editorial Com- 
mittee shall terminate with the appointment new Editor. The Editor shall serve 
editorial adviser the publication all scientific monographs and pamphlets authorized 
the Board. 

The Board Directors shall have charge the funds and the affairs the In- 
stitute, with the exception those affairs specifically assigned the President the 
Committee Membership. The Board shall have authority fill all vacancies in- 
terim, occurring among the Officers, Board Directors, any the Committees. The 
Board may appoint such other committees may required from time time carry 
the affairs the Institute. 

The Committee Membership shall prepare and make available through the Secre- 
tary-Treasurer announcement indicating the qualifications requisite for the different 
grades membership. 


ARTICLE 


Members shall pay five dollars the time admission membership and shall receive 
the full current volume the Official Journal. Thereafter, Members shall pay five dol- 
lars annual dues. The annual dues Junior Members shall two dollars and fifty cents. 

The annual dues Fellows shall five dollars. annual dues Sustaining Members 
shall fifty dollars. Honorary Members shall exempt from all dues. 

(a) Exception. the case that two Members the Institute are husband and wife 
and they elect receive between them only one copy the Official Journal, the annual 
dues each shall three dollars and seventy-five cents. 

Annual dues shall payable the first day January each year. 

The annual dues Fellow, Member, Junior Member include subscription the 
Official Journal. The annual dues Sustaining Member include two subscriptions 
the Official Journal. 

shall the duty the Secretary-Treasurer notify mail anyone whose dues 
may six months arrears, and accompany such notice copy this Article. 
such person fail pay such dues within three months from the date mailing such notice, 
the Secretary-Treasurer shall report the delinquent one the Board Directors, whom 
the person’s name may stricken from the rolls and all privileges membership with- 
drawn. Such person may, however, re-instated the Board Directors upon pay- 
ment the arrears dues. 


ARTICLE III 


SALARIES 


The Institute shall not pay salary any Officer, Director, member any com- 
mittee. 


ARTICLE 


AMENDMENTS 


These By-Laws may amended the same manner the Constitution 
majority vote any regularly convened meeting the Institute, the proposed amend- 
ment has been previously approved the Board Directors. 


: 
; 


